BRLMM: an Improved Genotype Calling Method for the GeneChip® Human Mapping 500K Array Set

ثبت نشده
چکیده

Highly accurate and reliable genotype calling is an essential component of any highthroughput SNP genotyping technology. The Dynamic Model (DM, [1]) which has been extensively used for the GeneChip® Human Mapping 100K Array Set and the GeneChip® Human Mapping 500K Array Set has proven to be very effective, however it is possible to do better. Rabbee & Speed recently developed a model called the Robust Linear Model with Mahalanobis distance classifier (RLMM, pronounced ‘realm’) which provided an improvement over DM on the Mapping 100K set [2,3,4]. We present here an extension of the RLMM model developed for the Mapping 500K product which provides a significant improvement over DM in two important areas – it improves overall performance (call rates and accuracy) and it equalizes the performance on homozygous and heterozygous genotypes. The difference between RLMM and this approach is the addition of a Bayesian step which provides improved estimates of cluster centers and variances, the new model is called BRLMM (pronounced ‘B-realm’).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BRLMM-P: a Genotype Calling Method for the SNP 5.0

Highly accurate and reliable genotype calling is an essential component of any highthroughput SNP genotyping technology. BRLMM, the method of choice for the Mapping 500K product, is effective, but requires the presence of mismatched probes (MM) probes on the array to create “seed” genotypes. We present here a method that only uses perfect-match probes, BRLMM-P. The primary difference between BR...

متن کامل

Model Based Probe Fitting and Selection for SNP Array

Recent advances of high-throughput SNP arrays such as Affymetrix’s GeneChip Human Mapping 500K array set have made it possible to genotype large samples in a fast and cheap manner. A lot of algorithms were developed to call the genotypes from SNP array. When considering the low level preprocessing of SNP array, most algorithms just borrow the techniques from the gene expression microarray. As i...

متن کامل

Look who is calling: a comparison of genotype calling algorithms

In genome-wide association studies, high-level statistical analyses rely on the validity of the called genotypes, and different genotype calling algorithms (GCAs) have been proposed. We compared the GCAs Bayesian robust linear modeling using Mahalanobis distance (BRLMM), Chiamo++, and JAPL using the autosomal single-nucleotide polymorphisms (SNPs) from the 500 k Affymetrix Array Set data of the...

متن کامل

[Genome-wide detection of loss of heterozygosity and copy number variation in a human lung large cell carcinoma cell line by affymetrix single-nucleotide polymorphism array 500K.].

BACKGROUND Loss of heterozygosity (LOH) and Copy number copy number variation (CNV) of DNA sequences is a common feature of cancer genomes, which is thought to be linked to tumorigenesis and progression. High-density singlenucleotide polymorphism (SNP) genotyping array are able to provided a genotype and copy number information with genome-wide coverage, which is suitable for the analysis of co...

متن کامل

SNiPer-HD: improved genotype calling accuracy by an expectation-maximization algorithm for high-density SNP arrays

MOTIVATION The technology to genotype single nucleotide polymorphisms (SNPs) at extremely high densities provides for hypothesis-free genome-wide scans for common polymorphisms associated with complex disease. However, we find that some errors introduced by commonly employed genotyping algorithms may lead to inflation of false associations between markers and phenotype. RESULTS We have develo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006